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We prove that interactive learning based classical realizability (introduced by Aschieri and Berardi for 
first order arithmetic LU) is sound with respect to Coquand game semantics. In particular, any realizer 
of an implication-and-negation-free arithmetical formula embodies a winning recursive strategy for 
the 1-Backtracking version of Tarski games. We also give examples of realizer and winning strategy 
extraction for some classical proofs. We also sketch some ongoing work about how to extend our 
notion of realizability in order to obtain completeness with respect to Coquand semantics, when it is 
restricted to 1-Backtracking games. 

1 Introduction 

In this paper we show that learning based realizability (see Aschieri and Berardi Q) relates to 1- 
Backtracking Tarski games as intuitionistic realizability (see Kleene HI) relates to Tarski games. It 
is well know that Tarski games (see, definition [T2]below) are just a simple way of rephrasing the concept 
of classical truth in terms of a game between two players - the first one trying to show the truth of a 
formula, the second its falsehood - and that an intuitionistic realizer gives a winning recursive strategy to 
the first player. The result is quite expected: since a realizer gives a way of computing all the information 
about the truth of a formula, the player trying to prove the truth of that formula has a recursive winning 
sti^ategy. However, not at all any classically provable arithmetical formula allows a winning recursive 
strategy for that player; otherwise, the decidability of the Halting problem would follow. In ||5|, Co- 
quand introduced a game semantics for Peano Arithmetic such that, for any provable formula A, the first 
player has a recursive winning strategy, coming from the proof of A. The key idea of that remarkable 
result is to modify Tarski games, allowing players to correct their mistakes and backtrack to a previ- 
ous position. Here we show that learning based realizers have direct interpretation as winning recursive 
strategies in 1-Backtracking Tarski games (which are a particular case of Coquand games see Q and 
definition [TT] below). The result, again, is expected: interactive learning based realizers, by design, are 
similar to strategies in games with backtracking: they improve their computational ability by learning 
from interaction and counterexamples in a convergent way; eventually, they gather enough information 
about the truth of a formula to win the game. 

An interesting step towards our result was the Hayashi realizability fT^. Indeed, a realizer in the sense 
of Hayashi represents a recursive winning strategy in 1-Backtracking games. However, from the com- 
putational point of view, realizers do not relate to 1-Backtracking games in a significant way: Hayashi 
winning strategies work by exhaustive search and, actually, do not learn from the game and from the 
interaction with the other player. As a result of this issue, constructive upper bounds on the length of 
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games cannot be obtained, whereas using our realizability it is possible. For example, in the case of the 
1 -Backtracking Tarski game for the formula 3xyyf{x) < f{y), the Hayashi realizer checks all the natu- 
ral numbers until an n such that Vy/(«) < f{y) is found; on the contrary, our realizer yields a strategy 
which bounds the number of backtrackings by /(O), as shown in this paper. In this case, the Hayashi 
strategy is the same one suggested by the classical truth of the formula, but instead one is interested in 
the constructive strategy suggested by its classical proof. 

Since learning based realizers are extracted from proofs in HA + EMi (Heyting Arithmetic with ex- 
cluded middle over existential sentences, see [1]), one also has an interpretation of classical proofs as 
learning strategies. Moreover, studying learning based realizers in terms of 1 -Backtracking games also 
sheds light on their behaviour and offers an interesting case study in program extraction and interpreta- 
tion in classical arithmetic. 

The plan of the paper is the following. In section ^ we recall the calculus of realizers and the 
main notion of interactive learning based realizability. In section ^ we prove our main theorem: a 
realizer of an arithmetical formula embodies a winning strategy in its associated 1 -Backtracking Tarski 
game. In section ^ we extract realizers from two classical proofs and study their behavior as learning 
strategies. In section ^ we define an extension of our realizability and formulate a conjecture about its 
completeness with respect to 1 -Backtracking Tarski games. 

2 The Calculus Tc,,, and Learning-Based Realizability 

The whole content of this section is based on Aschieri and Berardi ([Tl, where the reader may also find 
fuU motivations and proofs. We recall here the definitions and the results we need in the rest of the paper. 
The winning strategies for 1 -Backtracking Tarski games will be represented by terms of Tdass (see li ll). 
Tciass is a system of typed lambda calculus which extends Godel's system T by adding symbols for non 
computable functions and a new type S (denoting a set of states of knowledge) together with two basic 
operations over it. The terms of Tcu,ss are computed with respect to a state of knowledge, which represents 
a finite approximation of the non computable functions used in the system. 

For a complete definition of T we refer to Girard |6|. T is simply typed A -calculus, with atomic 
types N (representing the set N of natural numbers) and Bool (representing the set B = {True, False} of 
booleans), product types T xU and arrows types T ^U, constants : N, S : N — )• N, True, False : Bool, 
pairs (.,.), projections no,ni, conditional if^ and primitive recursion in all types, and the usual 
reduction rules (j8), (tt), (if ), (R) for A, (.,.), if 7-, Rj-. From now on, if t,u are terms of T with t = u'we 
denote provable equality in T. If k £ N, the numeral denoting k is the closed normal term S''{0) of type 
N. All closed normal terms of type N are numerals. Any closed normal term of type Bool in T is True 
or False. 

We introduce a notation for ternary projections: if T = A x (B x C), with pQ,pi,p2 we respectively 
denote the terms ttq, Ax : T.no{7ii{x)), Xx : T.ni{7ii{x)). If m = {uq, (mi,M2)) '■ T, then pju = ui in T for 
/ = 0, 1,2. We abbreviate (mq, {u\,U2)) '■ T with (mo,mi,M2) '■ T. 

Definition 1 (States of Knowledge and Consistent Union) 1. A k-ary predicate of J is any closed 
normal term P ^ Bool o/T. 

2. An atom is any triple {P,n,m), where P is a {k+ \)-ary predicate ofT, and n,m are {k+ 1) 
numerals, and Pnm = True in T. 

3. Two atoms {P,n,m), {P' ,n',m') are consistent ifP = P' and n = n' in T imply m = m'. 

4. A state of knowledge, shortly a state, is any finite set S ofpairwise consistent atoms. 
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5. Two states Si , ^2 are consistent if Si U ^2 is a state. 

6. S is the set of all states of knowledge. 

7. The consistent union 5i '^52 of Si , 52 G S w 5i U 52 G S minus all atoms ofS2 which are inconsistent 
with some atom of Si. 

For each state of knowledge 5 we assume having a unique constant 5 denoting it; if there is no ambiguity, 
we just assume that state constants are strings of the form {{P,ni,mi),..., {P,nk,mk)}, denoting a state of 
knowledge. We define with Tg = T + S + {5|5 G S} the extension of T with one atomic type S denoting 
S, and a constant 5 : S for each 5 G S, and no new reduction rule. Computation on states will be defined 
by a set of algebraic reduction rules we call "functional". 

Definition 2 (Functional set of rules) Let C be any set of constants, each one of some type Ai ^ . . . ^ 
An A, for some Ai , . . . , A„, A G {Bool, N, S}. We say that M is a functional set of reduction rules for 
C if ^ consists, for all c EC and all closed normal terms ai :Ai,...,a„ :A„ o/Ts, of exactly one rule 
cai ...an^ a, where a: A is a closed normal term o/Tg. 

We define two extensions of Tg: an extension Taass with symbols denoting non-computable maps Xp : 
N* Bool, Op : N*^ — > N (for each k-ary predicate P of T) and no computable reduction rules, another 
extension T^^^,„, with the computable approximations Xp, 0p of *J*p» and a computable set of reduction 
rules. Xp and Op are intended to represent respectively the oracle mapping n to the truth value of BxPnx, 
and a Skolem function mapping n to an element m such that ^xPnx holds iff Prim = True. We use 
the elements of Tdass to represent non-computable realizers, and the elements of TLeam to represent a 
computable "approximation" of a reaUzer. We denote terms of type S by p,p', . . .. 

Definition 3 Assume P : N*^"*"^ — > Bool is a k + l-ary predicate ofT. We introduce the following con- 
stants: 

L ;^p : S ^ Bool and (pp : S ^ N'' N. 

2. Xp : n'^ ^ Bool and Op : # ^ N. 

3. lyj : S S -> S (we denote ypiP2 with pi lyj p2j. 

4. Addp : N*=+' S and addp : S ^ ^ S. 
L Eg is the set of all constants Xpi^Pj ^,addp. 
2. E is the set of all constants Xp, Op, ^,Addp. 

3- Class = Tg + ^. 

4. A term t G Taass has state if it has no state constant different from 0. 

Let 7= ti ...tk- We interpret ;^p5Fand (jOp^F respectively as a "guess" for the values of the oracle and the 
Skolem map Xp and Op for 3y.PTy, guess computed w.r.t. the knowledge state denoted by the constant 
s. There is no set of computable reduction rules for the constants Op,Xp G S, and therefore no set 
of computable reduction rules for Tckss- If PbP2 denotes the states 5i,52 G S, we interpret pi lyj p2 as 
denoting the consistent union 5i 52 of 5i , 52- Addp denotes the map constantly equal to the empty state 
0. addp5nm denotes the empty state if we cannot add the atom (P, n, m) to 5, either because (P, n,m') eS 
for some numeral m', or because Pnm = False. addp5nm denotes the state {{P,n,m)} otherwise. We 
define a system TLeam with reduction rules over Sg by a functional reduction set ^g. 

Definition 4 (The System TLeam) Let s,si,S2be state constants denoting the states 5, 5i , 52. Let {P,n,m) 
be an atom, is the following junctional set of reduction rules for Sg; 
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1. If {P, n, m) G 5, then Xpsn f-^ True and (ppsn t-^ m, else Xpsn h-> False and cppsn h-)- 0. 

2. Sl\ii}S2^Sl^_S2 

3. addpsnm ^ ^ if either {P,n,m') G S for some numeral m' or Pnm = False, and addpsnm 
{ {P, n, m) } otherwise. 

We define Tuam = Ts + Ss + ^s- 

Remark. TLeam is nothing but Ts with some "syntactic sugar". TLeam is strongly normalizing, has Church- 
Rosser property for closed term of atomic types and: 

Proposition 1 (Normal Form Property for TLeam) Assume A is either an atomic type or a product type. 
Then any closed normal term t G T^^.^,,, of type A is: a numeral n : N, or a boolean True, False : Bool, 
or a state constant s :S, or a pair {u, v) : B xC. 

Definition 5 Assume t £ Tc^i and s is a state constant. We call "approximation oft at state s" the term 
t\s\ ofTuarn obtained from t by replacing each constant Xp with Xps, each constant Op with (f>ps, each 
constant Addp with addps. 

If s,s' are state constants denoting 5,5' G S, we write j <j' for5 C 5'. We say that a sequence {5^/}/eN 
of state constants is a weakly increasing chain of states (is w.i. for short), if s, < for all i G N. 

Definition 6 (Convergence) Assume that {5,}/gn is a w.i. sequence of state constants, and u,v G Tcto. 
/. u converges in {si}i^^ if3i G N.Vj > i.u[sj] = u[si] in T[^,„. 
2. u converges if u converges in every w.i. sequence of state constants. 

Our realizability semantics relies on two properties of the non computable terms of atomic type in Taass- 
First, if we repeatedly increase the knowledge state s, eventually the value of t [s] stops changing. Second, 
if t has type S, and contains no state constants but 0, then we may effectively find a way of increasing the 
knowledge state s such that eventually we have t[s\ = 0. 

Theorem 1 (Stability Theorem) Assume t G is a closed term of atomic type A (A G {Bool,N,S}j. 

Then t is convergent. 

Tlieorem 2 (Fixed Point Property) Let t : S be a closed term of Jciass of state 0, and s = S. Define 

t(5) = 5' ift[S] = 5', andf{S) = 5U t(5). 

1. For any « G N, define f^{S) = 5 and f+^iS) = f{f"iS)). There are /i G N, 5' G S such that 
S' = f^{S) D 5, /(50 = 5' and x{S') = 0. 

2. We may ejfectively find a state constant s' >s such that t[s'] = 0. 

Definition 7 (The language ^ of Peano Arithmetic) 1. The terms of are all / G T, such that 
t : N and FV{t) C {xj , . . . ,x^}for some xi, . . . 

2. The atomic formulas of ^ are all Qt\ . ..t„ gT, for some (2 : N" ^ Bool closed term of T, and 
some terms ?i, . . . 

3. The formulas of^ are built from atomic formulas of^ by the connectives V, A, — > V, 3 as usual. 

Definition 8 (Types for realizers) For each arithmetical formula A we define a type \A\ of J by induc- 
tion on A: \P{ti,...,tn)\ = S, |AAB| = \A\ x |AVB| = Bool x (|A| x \A ^ B\ = \A\ \B\, 
\ixA\=N^\A\, \3xA\=lix\A\ 
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We define now our notion of realizability, which is relativized to a knowledge state s, and differs from 
Kreisel modified realizability for a single detail: if we realize an atomic formula, the atomic formula 
does not need to be true, unless the realizer is equal to the empty set in s. 

Definition 9 (Realizability) Assume s is a state constant, t G Taass is a closed term of state 0, A € ^ is 
a closed formula, and t : \A\. Let ? = , . . . , f„ : N. 

1. f lll-j P(t) if and only ift[s\ = in T team implies P(t) = True 

2. t \\\-sAABifand only if llQt llhs- A and Kit !lhv B 

3. t IW-gAV B if and only if either pot[s] = True in Ti^,„,„ and pit Wt^A, or pQt[s\ = False in Ti^,„,„ and 
P2t\h,B 

4. t llhs- A — )■ B if and only if for all u, ifu A, then tu \\\-s B 

5. t VxA if and only if for all numerals n, tn A[n/x\ 

6. t Hhv 3xA if and only for some numeral n, Tl()t[s\ = n in Ti^„,„ and 7l[t \\\-sA[n/x] 
We define f llh A if and only ift 1 1 h^. A /or all state constants s. 

Theorem 3 If A is a closed formula provable in HA + EMi (see /[7]/j, then there exists t G Tams such that 
t llhA. 

3 Games, Learning and Realizability 

In this section, we define the notion of game, its 1 -Backtracking version andTarski games. We also prove 
our main theorem, connecting learning based realizability and 1 -Backtracking Tarski games. 

Definition 10 (Games) 1. A game G between two players is a quadruple {y,E\,E2.,W), where V is a 
set, E\,E2 are subsets ofV x V such that Dom{Ei) r\Dom{E2) = 0, where Dom{Ei) is the domain 
ofEi, and W is a set of sequences, possibly infinite, of elements ofV. The elements ofV are called 
positions of the game; E\, Ej are the transition relations respectively for player one and player 
two: (vi , V2) € Ei means that player i can legally move from the position vi to the position V2- 

2. We define a play to be a walk, possibly infinite, in the graph {V^Ei UE2), i.e. a sequence, possibly 
void, vi :: V2 ::...:: v„ . of elements ofV such that (v,-,v,+i) € E\ UE2for every i. A play of 
the form v\ :: V2 v„ ::.. . is said to start from vi. A play is said to be complete if it is either 
infinite or is equal to vi ... :: v„ and v„ ^ Dom{E[ □£'2). W is required to be a set of complete 
plays. If p is a complete play and p gW, we say that player one wins in p.Ifp is a complete play 
and p ^W, we say that player two wins in p. 

3. Let Pg be the set of finite plays. Consider a function f : Pq ^V; a play vi v„ . is said 
to be f -correct if f{v\ ,.. . ,v,) = v;+i for every i such that (v,-, v,_|_i) E E\ 

4. A winning strategy /rom position vfor player one is a function W.Pg^V such that every complete 
(0-correct play v :: vi v„ ::.. . belongs to W. 

Notation. If for / E N,/ = 1, . . . ,?i we have that pt = {pi)o ::...:: is a finite sequence of elements 
of length ni, with pi ::...:: p„ we denote the sequence 

{pi)o ::...:: {pi)n, {Pk)o {Pk)nk 
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where {pi)j denotes the 7-th element of the sequence pi. 

Suppose that ai :: ■■■ an is a play of a game G, representing, for some reason, a bad situation 
for player one (for example, in the game of chess, a„ might be a configuration of the chessboard in which 
player one has just lost his queen). Then, learnt the lesson, player one might wish to erase some of his 
moves and come back to the time the play was just, say, ai ,a2 and choose, say, b\ in place of a^; in other 
words, player one might wish to backtrack. Then, the game might go on as ai :: a2 ::b\ :: .. . :: b,n and, 
once again, player one might want to backtrack to, say, a\ :: a2 b\ :: ... :: bi, with / < m, and so on... 
As there is no learning without remembering, player one must keep in mind the errors made during the 
play. This is the idea of 1 -Backtracking games (for more motivations, we refer the reader to |4j and IS) 
and here is our definition. 

Definition 11 (1-Backtracking Games) Let G = {V,Ei,E2,W) be a game. 

1. We define \Back{G) as the game {Pq,E[,E2,W), where: 

2. Pq is the set of finite plays ofG 

3. E2 '■= {{p :: a, p :: a :: b) \ p,p :: a E Pq, {a,b) € £"2} and 

E[ := {{p :: a, p :: a b) \ p,p a £ Pq, {a,b) G ^i} U 

{{p a q :: d, p :: a) \ p,q £ Pg,P a :: q :: d £ Pea € Dom{Ei) 
d ^ Dom{E2),p :: a :: q :: d ^ W}; 

4. W' is the set of finite complete plays p\ :: ... :: p„ of {Pq,E[,E2) such that p„ € W. 

Note. The pair {p ::a::q ::d, pv. a) in the definition above of E2 codifies a backtracking move by player 
one (and we point out that q:: d might be the empty sequence). 

Remark. Differently from [4], in which both players are allowed to backtrack, we only consider the case 
in which only player one is supposed do that (as in ||7l). It is not that our results would not hold: clearly, 
the proofs in this paper would work just as fine for the definition of 1-Backtracking Tarski games given 
in [4J. However, as noted in [4], any player-one recursive winning strategy in our version of the game can 
be effectively transformed into a winning strategy for player one in the other version the game. Hence, 
adding backtracking for the second player does not increase the computational challenge for player one. 
Moreover, the notion of winner of the game given in f4\ is strictly non constructive and games played by 
player one with the correct winning strategy may even not terminate. Whereas, with our definition, we 
can formulate our main theorem as a program termination result: whatever the strategy chosen by player 
two, the game terminates with the win of player one. This is also the spirit of realizability and hence of 
this paper: the constructive information must be computed in a finite amount of time, not in the limit. 

In the well known Tarski games, there are two players and a formula on the board. The second player 
- usually called Abelard - tries to show that the formula is false, while the first player - usually called 
Eloise - tries to show that it is true. Let us see the definition. 

Definition 12 (Tarski Games) Let A be a closed implication and negation free arithmetical formula of 
.if. We define the Tarski game for A as the game Ta = {y,E\,E2,W), where: 
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1. V is the set of all subformula occurrences of A; that is, V is the smallest set of formulas such that, 
if either A \/ B or A AB belongs to V, then A,B £V; if either VxA(x) or 3xA{x) belongs to V, then 
A{n) € V for all numerals n. 

2. El is the set of pairs (Ai ,A2) (zV xV such that A[ = 3xA{x) and A2 = A{n), or Ai = A V B and 
either A2 = A or A2 = B; 

3. E2 is the set of pairs (Ai,A2) (zV xV such that A i = VxA(x) and A2 =A{n), or A\ =AAB and 
A2 = A or A2 =B; 

4. W is the set of finite complete plays Ai :: ... :: A„ such that An = True. 

Note. We stress that Tarski games are defined only for implication and negation free formulas. Indeed, 
\Back{TA), when A contains implications, would be much more involved and less intuitive (for a defini- 
tion of Tarski games for every arithmetical formula see for example Berardi fT\). 

What we want to show is that ift\\\-A,t gives to player one a recursive winning strategy in \Back{TA). 
The idea of the proof is the following. Suppose we play as player one. Our strategy is relativized to a 
knowledge state and we start the game by fixing the actual state of knowledge as 0. Then we play in the 
same way as we would do in the Tarski game. For example, if there is VxA(x) on the board and A{n) 
is chosen by player two, we recursively play the strategy given by tn; if there is 3xA(x) on the board, 
we calculate 7rof[0] = n and play A{n) and recursively the strategy given by n\t. If there is A VB on the 
board, we calculate /?o^[0]> and according as to whether it equals True or False, we play the strategy 
recursively given by p\t or p2t. If there is an atomic formula on the board, if it is true, we win; otherwise 
we extend the current state with the state lU) f [0] , we backtrack and play with respect to the new state of 
knowledge and trying to keep as close as possible to the previous game. Eventually, we will reach a state 
large enough to enable our realizer to give always correct answers and we will win. Let us consider first 
an example and then the formal definition of the winning strategy for Eloise. 

Example (EMi). Given a predicate P of T, and its boolean negation predicate -iP (which is repre- 
sentable in T), the realizer Ep of 

EMi := Vx. 3j P{x,y)y\/y^P{x,y) 

is defined as 

Xa^{Xpa, (Opa, 0), An/ Addpam) 

According to the rules of the game \Back{T^K^), Abelard is the first to move and, for some numeral n, 
chooses the formula 

3yP{n,y)y^y^P{n,y) 
Now is the turn of Eloise and she plays the strategy given by the term 

{Xpn, {<t>pa, 0), Xm^ Addpnm) 
Hence, she computes Xp?i[0] = ;t;p0n = False (by definition H]), so she plays the formula 

\/y^P{n,y) 
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and Abelard chooses m and plays 

-iP(?i,m) 

If -'P{n,m) = True, Eloise wins. Otherwise, she plays the strategy given by 

(Am" Addpam)m[0] = addpdnm = {{P,n,m)} 
So, the new knowledge state is now {{P,n,m)} and she backtracks to the formula 

3yP{n,y)vyy^P{n,y) 
Now, by definition |4j Xpn[{{P,n,m)}] = True and she plays the formula 

^yP{n,y) 

calculates the term 

7lo{<t>pn, (d)[{{P,n,m)}] = (pp{{P,n,m)}n = m 

plays P{n,m) and wins. 

Notation. In the following, we shall denote with upper case letters A,B,C closed arithmetical formulas, 
with lower case letters p,q,r plays of Ta and with upper case letters P,Q,R plays of \Back{TA) (and all 
those letters may be indexed by numbers). To avoid confusion with the plays of Ta, plays of IBack(rA) 
will be denoted as pi,...,pn rather than p\ : : ... : : Moreover, if P = qi, . . . ,qm, then P,pi, . . . ,pn will 
denote the sequence q\,. . . ,qm,pi, ■ ■ ■ Pn- 

Definition 13 Fix u such that u\\\- A. Let p be a finite play o/Ta starting with A. We define by induction 
on the length of p a term p{p) £ Tc/n,., (read as 'the realizer adapt to p') in the following way: 

1. If p =A, then p (p) = u. 

2. If p = (q :: 3xB{x) :: B{n)) and p{q :: 3xB{x)) = t, then p{p) = Tlit. 

3. If p = {q :: VxB(x) :: B{n)) and p{q :: VxB(x)) = t, then p{p) = tn. 

4. If p= (q:: BqABi :: B,) and p{q :: BqABi) = t, then p{p) = Hit. 

5. If p = {q:: Bi VB2 Bi) and p{q :: Bi V B2) = t, then p{p) = pjt. 
Given a play P = Q,q B of \Back{TA), we set p{P) = p{q B). 

Definition 14 Fix u such that u llh A. Let p be as in definition \13\and P be a finite play of \Back{TA) 
starting with A. We define by induction on the length of P a state (read as 'the state associated to 
P') in the following way: 

1. IfP=A,thenL{P) = 0. 

2. IfP = {Q,p::B,p::B::C) andL{Q,p B) = s, then L{P) = s. 

3. If P = {Q,p :: B :: q,p :: B) and T.{Q,p :: B :: q) = s and p{Q,p B :: q) = t, then if t : S, then 
r(P) = s^t[sl else L{P) = s. 

Definition 15 (Winning strategy for IBack(rA)) Fix u such that m llh A. Let p and £ be respectively as 
in definition \13\and\T4\ We define a function CO from the set of finite plays of \Back{TA) to set of finite 
plays ofTA,' CO is intended to be a recursive winning strategy from A for player one in \Back{TA). 
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1. Ifp{P,q :: 3xB{x)) = t, ^{P,q :: 3xB(x)) = s and {Hot)[s\ = n, then 

(0{P,q :: 3xB{x)) = q :: 3xB{x) :: B{n) 

2. Ifp{P,q ::B VC) =t and'L{P,q ■.■.ByC) = s, then if {pQt)[s\ = True then 

(0{P,q :: By C) = q By C B 

else 

(0{P,q::B\JC) = q::ByC::C 

3. If An is atomic, An = False, p(/',Ai ::•••:: A,,) = t and Y.{P,A\ ::•••:: A„) = s, then 

(0{PM :: ■■■■.-.An) =Ai :: ••• ::A,- 
where i is equal to the smallest j <n such that p{A[ :: • • • :: Ay) = w and either 
Aj = 3xC{x) A Ay+i = C{n) A (tTqw) [s VJ)t[s]] / n 

or 

Ay =Bi VB2 AAy+i =BiA{pow)[sWt[s]] = False 

or 

Aj=Bi\/B2AAj+i =B2A{pQw)[sWt[s]] = True 
If such j does not exist, we set i = n. 

4. In the other cases, (0{P,q) = q. 

Lemma 1 Suppose m llh A and p,L,(0 as in definition [75l Let Q be a finite co-correct play of \Back{TA) 
starting with A, p{Q) = t, Z{Q) = s. IfQ = Q' ,q' :: B, then t llh, B. 

Proof. By a straightforward induction on the length of Q. 

Theorem 4 (Soundness Theorem) Let A be a closed negation and implication free arithmetical for- 
mula. Suppose that u llh A and consider the game \Back{TA). Let (0 be as in definition [75l Then (0 is a 
recursive winning strategy from A for player one. 

Proof. The theorem will be proved in the full version of this paper. The idea is to prove it by contradiction, 
assuming there is an infinite G)-correct play. Then one can produce an increasing sequence of states. 
Using theorems [U and |2l one can show that Eloise's moves eventually stabilize and that the game results 
in a winnning position for Eloise. 

4 Examples 

Minimum Principle for functions over natural numbers. The minimum principle states that every 
function / over natural numbers has a minimum value, i.e. there exists an f{n) £ N such that for every 
HI G N f{m) > f{n). We can prove this principle in HA + EMi, for any / in the language. We assume 
P{y^x) = f{x) < y, but, in order to enhance readability, we will write f{x) < y rather than the obscure 
P{y,x). We define: 
Lessef{n) := 3a/(a) < n 
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Lessf{n) := 3a/(a) < n 
Notlessf{n) := \/af{a) > n 

Then we formulate - in equivalent form - the minimum principle as: 

Hasminf := 3y. Notlessf{y) ALessef{y) 

The informal argument goes as follows. As base case of the induction, we just observe that f{k) < 0, 
implies / has a minimum value (i.e. f{k)). Afterwards, if Notlessf{f(0)), we are done, we have find 
the minimum. Otherwise, Lessf{f{0)), and hence /(a) < /(O) for some a given by the oracle. Hence 
/(o;) < /(O) — 1 and we conclude that / has a minimum value by induction hypothesis. 

Now we give the formal proofs, which are natural deduction trees, decorated with terms of Tci^ss, as 
formalized in |[T1. We first prove that V?i. {Lessef{n) Hasminf) — > {Lessef{S{n)) — > Hasminf) holds. 

[Notlessf{S{n))] [Lessf{S{n))] 
Ep\\ln.Notlessf{S{n))\/Lessf{S{n)) Di D2 



Epn:Notlessf{S{n))\/ Lessf{S{n)) Hasminf Hasminf 

D ; Hasminf 



XwiD : Lessef{S{n)) — > Hasminf 



Xw\Xw2D : {Lessef{n) -^Hasminf) — !> {Lessef(S{n)) -^Hasminf) 
XnXwiXw2D : yn{Lessef(n) — >■ Hasminf) — > [Lessef{S{n) — > Hasminf) 

where the term D is looked at later, D\ is the proof 

vi :Notlessf{S{n)) W2 : Lessef{S{n)) 
(vi,W2) : Notlessf{S{n)) ALessef{S{n)) 



{S{n), {vi,W2)) ■ Hasminf 

and D2 is the proof 

[X2 : f{z) < Sjn)] 
X2 ■■ f{z) < n 

wi : [Lessef{n) -^Hasminf] {z,X2) '■ Lessef{n) 
V2 '■ [Lessf{S{n))] wi{z,X2) : Hasminf 

Wi (7roV'2, TTi V2) : Hasminf 

We prove now that Lessef{0) — Hasminf 

xi ■■ [/(z) < 0] 



xi:f{z)^0 



x,:f{a)>f{z) <a:fiz)<fiz) 



Xaxi : Notlessf{f{z)) (z,0) : Lessef(f{z)) 
(Aaxi, (z,0)) : Notlessf{f{z))Messef{f{z)) 
w : [Lessef{Q)] {f{z),{Xaxi,{z,(d))) : Hasminf 

{f{now),{XaniW, (tTow,©))) : Hasminf 
F :— Xw{f{7low), {Xaniw, (now,®))) : Lessef{Q) Hasminf 
Therefore we can conclude with the induction rule that 

Aa" RF{XnXwiXw2D)a : \/x.Lessef{x) -^Hasminf 



And now the thesis: 
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: /(O) < /(O) Aa" RF{?in?iWiXw2D)a : \/x.Lessef{x) -> Hasminf 

{0,9) :Lessef{f{Q)) RF{XnXwiXw2D)f{0) : Lessef{f{0)) ^Hasminf 
M := KF{XnXwiXw2D)f{0){0,(i) : Hasminf 

Let us now take a closer look to D. We have defined 

D:=ifXpS{n) then wi (<!>?%), 0) else {S{n),{Xp (Addp)%)j3,W2)) 

Let J be a state and let us consider M, the reaUzer of Hasminf, in the base case of the recursion and after 
in its general form during the computation: RF(AnAwiAw2i))/(O)(m,0)[j]. If /(O) = 0, 

M[s] = RF{XnXwiXw2D)f{0){0,(l))[s] = 

= F(O,0) = (/(O),(Aa0,(O,0)) 
If /(O) = S{n), we have two other cases. If XpsS{n) = True, then 

RF{XnXwilw2D)S{n){m,(l))[s] = 

= {XnXwiXw2D)n{RF{XnXwiXw2D)n){m,Ql)[s] = 
= RF{XnXwiXw2D)n{^p{S{n)),&)[s] 

If XpsS{n) = False, then 

RF{XnXwiXw2D)S{n){m,il))[s] = 

= {XnXwiXw2D)n{B.F{XnXwiXw2D)n){m,Q))[s] = 

= {S{n),{Xp (addp>5(n)i3,{m,0))) 

In the first case, the minum value of / has been found. In the second case, the operator R, starting from 
S{n), recursively calls itself on n; in the third case, it reduces to its normal form. From these equations, we 
easily deduce the behavior of the realizer of Hasminf. In a pseudo imperative programming language, 
for the witness of Hasmin f we would write: 
n:=/(0); 

while iXpsn = True, i.e. 3m such that f{m) < n G 
do n\=n — \; 
return n; 

Hence, when /(O) > 0, we have, for some numeral k 

M[s] = {k,{Xp {addp)skp,{(ppsk,(l)))) 

It is clear that k is the minimum value of /, according to the partial information provided by s about /, 
and that f{(ppsk) < ^. If 5' is sufficiently complete, then k is the true minimum of /. 
The normal form of the realizer M of Hasminf is so simple that we can immediately extract the winning 
strategy fi) for the 1-Backtraking version of the Tarski game for Hasminf. Suppose the current state of 
the game is s. If /(O) = 0, Eloise chooses the formula 

NotlessfiO) ALessefiO) 

and wins. If /(O) > 0, she chooses 
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Notlessf{k) ALessef{k) = Va /(a) > ^ A 3a /(a) < k 

If Abelard chooses 3a /(«) < k, she wins, because she responds with f{(ppsk) < k, which holds. Sup- 
pose hence Abelard chooses 

Va f{a)>k 

and then /(j3) > k. If it holds, Eloise wins. Otherwise, she adds to the current state s 

(Aj3 {a&&p)sk^)^ = {addp)skl5 = {/(jS) < k} 
and backtracks to Hasminf and then plays again. This time, she chooses 

Notlessf{f{li))ALessef{f{l5)) 

(using /(j3), which was Abelard's counterexample to the minimality of k and is smaller than her previous 
choice for the minimum value). After at most /(O) backtrackings, she wins. 

Coquand's Example. We investigate now an example - due to Coquand - in our framework of 
realizability. We want to prove that for every function over natural numbers and for every a G N there 
exists X G N such that f{x) < f{x + a). Thanks to the minimum principle, we can give a very easy 
classical proof: 

[Notlessf(p) ALessef{n)] 
Not less f(ll) 

f{z + a)>H [f{z)<H] 

f{z)<f{z + a) 

[Notlessf(ji) ALessefin)] ^f(x) < f(x + a) 

Lessefin ) WaBxf{x) <f{x + a) 

Hasminf ya3xf{x)<f{x + a) 

VaBxf{x)<f{x + a) 

The extracted realizer is 

Aa(;ib;ri;riM,;ib;riM(;ib;ri;riM + a) ikJ TriTTiTTih) 

where M is the realizer of Hasminf. m := ttq^i niM[s] is a point the purported minimum value /i := TioM 
of / is attained at, accordingly to the information in the state s (i.e. f{m) < /i). So, if Abelard chooses 

3xf{x)<f{x + a) 

Eloise chooses 

f{m) <f{m + a) 

We have to consider the term 

U[s\ := TiQniM{noK\Ti\M + a)m TiiTiiTiiM[s\ 

which updates the current state s. Surely, n\n\n\M[s\ = 0. 7ro^iM[s] is equal either to Aj3 (addp)j/ij3 or 
to Aa0. So, what does U[s] actually do? We have: 

U[s] = ;ib;riM(;io;ri;riM + a)[j] = 7ro7iiM{m + a)[s] 
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with either 7lo^iM{m + a)[s] = or 

noniM{m + a)[s] = {f{m + a) < f{m)} 

So U[s] tests if f{m + a) < /(m); if it is not the case, Eloise wins, otherwise she enlarges the state 
s, including the information f{m + a) < f{m) and backtracks to 3xf{x) < f{x + a). Starting from the 
state 0, after A: + 1 backtrackings, it will be reached a state s', which will be of the form {f{{k+ \ )a< 
f{ka),. . . J{2a) < f{a),f{a) < /(O)} and Eloise will play f{{k+l)a) < f{{k+l)a + a). Hence, the 
extracted algorithm for Eloise's witness is the following: 

n := 0; while f{n) > f{n + a) don:=n + a; return n; 

5 Partial Recursive Learning Based Realizability and Completeness 

In this section we extend our notion of realizability and increase the computational power of our realizers, 
in order to be able to represent any partial recursive function and in particular, we conjecture, every 
recursive strategies of 1-Backtracking Tarski games. So, we choose to add to our calculus a fixed point 
combinator Y, such that for every term u:A^A,Yu = m(Yu). 

Definition 16 (Systems PCFdass and PCpLeam ) We define PCFcte and PCFi^„„ to be, respectively, the 
extensions of Tc;a„ and Tij,„„ obtained by adding for every type A a constant Ya of type (A ^ A) A 
and a new equality axiom Yam = uiy ^u) for every term m : A — > A. 

Since in PCFdass there is a schema for unbounded iteration, properties like convergence do not hold 
anymore (think about a term taking a states s and returning the largest n such that Xpsn = True ). So 
we have to ask our realizers to be convergent. Hence, for each type A of PCFdass we define a set ||A|| of 
terms u : A which we call the set of stable terms of type A. We define stable terms by Ufting the notion 
of convergence from atomic types (having a special case for the atomic type S, as we said) to arrow and 
product types. 

Definition 17 (Convergence) Assume that {jijigN is a w.i. sequence of state constants, and m,v G 
PCFq„„. 

1. u converges in {si}i^^ if there exists a normal form v such that Bi^j > /•M[5j] = v in PCFi^„„. 

2. u converges if u converges in every w.i. sequence of state constants. 

Definition 18 (Stable Terms) Let {si}i^^ be a w.i. chain of states and j G S. Assume A is a type. We 

define a set \\A\\ of terms t € PCFcta of type A, by induction on A. 

L ||S|| = {t :S\t converges} 

2. ||N|| = {f : N I ? converges} 

3. ||Boo1|| = {t : Bool | t converges} 

4. ||A xB|| = {? : A xB I tiqI € ||A||,;/ri? G ||B||} 

5. ||A = {t: A ^ B \ \/u e \\A\\,tu G ||B||} 
Ift G ||A||, we say that t is a stable term of type A. 

Now we extend the notion of realizability with respect to PCFdass and PCFLeam. 
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Definition 19 (Realizability) Assume s is a state constant, t € PCFc/o,., is a closed term of state 0, A € =Sf 
is a closed formula, andt € 1||A|||. Let7= ti,. . . ,tn : N. 

1. t llhs P(t) if and only ift[s] = in PCfi^^„, implies P(t) = True 

2. t Whs A /\B if and only ifjlot llhs- A and Kit llhv B 

3. t \\\- s Ay B if and only if either pQt[s\ = True in PQ^Leam and pit llh^ A, or pQt[s\ = False in PCFl„,™ 
and p2t B 

4. t \\\-sA^ B if and only if for all u, ifu llh^ A, then tu \\\-sB 

5. t llhy \/xA if and only if for all numerals n, tn llh^ A[n/x\ 

6. t lll-,v 3xA if and only for some numeral n, 7lot[s] = n in PCF^^,,, and K\t Wt^ A[n/x\ 
We define t\\\- A if and only ift llhs- A for all state constants s. 

The following conjecture will be addressed in the next version of this paper: 

Theorem 5 (Conjecture) Suppose there exists a recursive winning strategy for player one in \Back{TA). 
Then there exists a term t ofPCfciass such that f llh A. 

6 Conclusions and Further work 

The main contribution of this paper is conceptual, rather than technical, and it should be useful to under- 
stand the significance and see possible uses of learning based realizability. We have shown how learning 
based realizers may be understood in terms of backtracking games and that this interpretation offers a 
way of eliciting constructive information from them. The idea is that playing games represents a way of 
challenging realizers; they react to the challenge by learning from failure and counterexamples. In the 
context of games, it is also possible to appreciate the notion of convergence, i.e. the fact that realizers 
stabilize their behaviour as they increase their knowledge. Indeed, it looks like similar ideas are useful 
to understand other classical realizabilities (see for example, Miquel 191). 

A further step will be taken in the full version of this paper, where we plan to solve the conjecture 
about the completeness of learning based realizability with respect to IBacktracking games. As pointed 
out by a referee, the conjecture could be interesting with respect to a problem of game semantics, i.e. 
whether all recursive innocent strategies are intepretation of a term of PCF. 
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